Learning with Value-Ramp

نویسندگان

  • Tom J. Ameloot
  • Jan Van den Bussche
چکیده

We study a learning principle based on the intuition of forming ramps. The agent tries to follow an increasing sequence of values until the agent meets a peak of reward. The resulting Value-Ramp algorithm is natural, easy to configure, and has a robust implementation with natural numbers.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning and process improvement during production ramp-up

Rapid product lifecycles and high development costs pressure manufacturing "rms to cut not only their development times (time-to-market), but also the time to reach full capacity utilization (time-to-volume). The period between completion of development and full capacity utilization is known as production ramp-up. During that time, the new production process is ill understood, which causes low ...

متن کامل

Analysis of an Adaptive Iterative Learning Algorithm for Freeway Ramp Flow Imputation

We present an adaptive iterative learning based flow imputation algorithm, to estimate missing flow profiles in on ramps and off ramps using a freeway traffic flow model. We use the LinkNode Cell transmission model to describe the traffic state evolution in freeways, with on ramp demand profiles and off ramp split ratios (which are derived from flows) as inputs. The model based imputation algor...

متن کامل

Ramp loss linear programming support vector machine

The ramp loss is a robust but non-convex loss for classification. Compared with other non-convex losses, a local minimum of the ramp loss can be effectively found. The effectiveness of local search comes from the piecewise linearity of the ramp loss. Motivated by the fact that the `1-penalty is piecewise linear as well, the `1-penalty is applied for the ramp loss, resulting in a ramp loss linea...

متن کامل

Structured Ramp Loss Minimization for Machine Translation

This paper seeks to close the gap between training algorithms used in statistical machine translation and machine learning, specifically the framework of empirical risk minimization. We review well-known algorithms, arguing that they do not optimize the loss functions they are assumed to optimize when applied to machine translation. Instead, most have implicit connections to particular forms of...

متن کامل

The Use of Cooperative Approach in Ramp Metering

To ensure higher Level of Service (LoS) at urban motorways, new traffic control concepts are being applied since in most cases there is no available space for infrastructural build-up. For urban motorways, the mostly used control methods are ramp metering combined with additional control methods like variable speed limit control (VSLC). This paper gives a review of the current ramp metering app...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1608.03647  شماره 

صفحات  -

تاریخ انتشار 2016